Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games

نویسندگان

  • Karel Horák
  • Branislav Bosanský
چکیده

We study two player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. The players have partial observability, however, the evader is given an advantage of knowing the current position of the units of the pursuer. We show that (1) value functions of this game depend only on the position of the pursuing units and the belief the pursuer has about the position of the evader, and (2) that they are piecewise linear convex functions in the belief. Finally, we exploit this representation of value functions and design a value iteration algorithm that approximates the value of the game.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generating strategies for multi-agent pursuit-evasion games in partially observable Euclidean space

We introduce a heuristic search technique for multi-agent pursuit-evasion games in partially observable Euclidean space where a team of tracker agents attempt to minimize their uncertainty about an evasive target agent. Agents’ movement and observation capabilities are restricted by polygonal obstacles, while each agents’ knowledge of the other agents is limited to direct observation or periodi...

متن کامل

Adversarial Planning for Multi-Agent Pursuit-Evasion Games in Partially Observable Euclidean Space

We describe a heuristic search technique for multi-agent pursuit-evasion games in partially observable Euclidean space where a team of trackers attempt to minimize their uncertainty about an evasive target. Agents’ movement and observation capabilities are restricted by polygonal obstacles, while each agent’s knowledge of the other agents is limited to direct observation or periodic updates fro...

متن کامل

Multi - Cumulant and Non - Inferior Strategies for Multi - Player

The paper presents an extension of cost-cumulant control theory over a finite horizon for a class of two-team pursuit-evasion games wherein the evolution of the states of the game in response to decision strategies selected by pursuit and evasion teams from non-inferior sets of admissible controls is described by stochastic linear differential equation and integral quadratic cost. Since the sum...

متن کامل

The Genesis of Differential Games in Light of Isaacs Contributions

Rufus P. Isaacs joined the RAND Corporation4, Santa Monica, California in 1948 and started to develop the theory of dynamic games in the early 1950s. Until winter 1954/55, when Isaacs left the RAND Corporation, he investigated two player, zero-sum dynamic games of the classic pursuit-evasion type. Prior to 1965, Isaacs published his theory only in internal RAND papers and research memoranda. In...

متن کامل

Dynamic Programming for Partially Observable Stochastic Games

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017